An Image Consulting Framework for Document Analysis of Internet Graphics
نویسندگان
چکیده
A new system approach for image understanding, the image consulting framework, is proposed. It allows for the validation of image properties. Kinds of image properties considered are textual, textural, hierarchically, color and symbolically. Its main application field is information filtering from images used in world wide web documents. The image consulting framework consists of four stages, the color separation stage, the information granulationverification modules (GVMs), the task stage and the recognition stage. On the base of the framework are the GVMs, which are designed to solve very special tasks. They consists of three parts, a method maintainer, a parameter chooser and a tester. The parameter chooser uses a given set of parameter settings for different runs of the maintained method on the input images of the GPM. The resulting images are tested for the occurrence of the propertyfor which the GVM is designed. All succesfull images are put into a queue. The task stages calls new GVMs due to the filling of the queue and it also assigns input images to the GVMs. All fully treated images are passed to the recognition stage, where the information extraction is performed.
منابع مشابه
Directional Stroke Width Transform to Separate Text and Graphics in City Maps
One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملConnected Component Based Word Spotting on Persian Handwritten image documents
Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997